Overview

Brought to you by YData

Dataset statistics

Number of variables19
Number of observations21613
Missing cells0
Missing cells (%)0.0%
Duplicate rows183
Duplicate rows (%)0.8%
Total size in memory3.1 MiB
Average record size in memory152.0 B

Variable types

Numeric15
Categorical4

Alerts

Dataset has 183 (0.8%) duplicate rowsDuplicates
bathrooms is highly overall correlated with bedrooms and 6 other fieldsHigh correlation
bedrooms is highly overall correlated with bathrooms and 2 other fieldsHigh correlation
floors is highly overall correlated with bathrooms and 3 other fieldsHigh correlation
grade is highly overall correlated with bathrooms and 6 other fieldsHigh correlation
long is highly overall correlated with zipcodeHigh correlation
price_gt_1M is highly overall correlated with grade and 1 other fieldsHigh correlation
sqft_above is highly overall correlated with bathrooms and 5 other fieldsHigh correlation
sqft_living is highly overall correlated with bathrooms and 5 other fieldsHigh correlation
sqft_living15 is highly overall correlated with bathrooms and 3 other fieldsHigh correlation
sqft_lot is highly overall correlated with sqft_lot15High correlation
sqft_lot15 is highly overall correlated with sqft_lotHigh correlation
view is highly overall correlated with waterfrontHigh correlation
waterfront is highly overall correlated with viewHigh correlation
yr_built is highly overall correlated with bathrooms and 2 other fieldsHigh correlation
zipcode is highly overall correlated with longHigh correlation
waterfront is highly imbalanced (93.6%) Imbalance
view is highly imbalanced (72.2%) Imbalance
price_gt_1M is highly imbalanced (63.8%) Imbalance
sqft_basement has 13126 (60.7%) zeros Zeros
yr_renovated has 20699 (95.8%) zeros Zeros

Reproduction

Analysis started2025-06-01 01:00:05.532742
Analysis finished2025-06-01 01:02:16.670918
Duration2 minutes and 11.14 seconds
Software versionydata-profiling vv4.16.1
Download configurationconfig.json

Variables

bedrooms
Real number (ℝ)

High correlation 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3708416
Minimum0
Maximum33
Zeros13
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:17.045592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median3
Q34
95-th percentile5
Maximum33
Range33
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.93006183
Coefficient of variation (CV)0.27591383
Kurtosis49.063653
Mean3.3708416
Median Absolute Deviation (MAD)1
Skewness1.9742995
Sum72854
Variance0.86501501
MonotonicityNot monotonic
2025-05-31T21:02:17.529869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
3 9824
45.5%
4 6882
31.8%
2 2760
 
12.8%
5 1601
 
7.4%
6 272
 
1.3%
1 199
 
0.9%
7 38
 
0.2%
0 13
 
0.1%
8 13
 
0.1%
9 6
 
< 0.1%
Other values (3) 5
 
< 0.1%
ValueCountFrequency (%)
0 13
 
0.1%
1 199
 
0.9%
2 2760
 
12.8%
3 9824
45.5%
4 6882
31.8%
5 1601
 
7.4%
6 272
 
1.3%
7 38
 
0.2%
8 13
 
0.1%
9 6
 
< 0.1%
ValueCountFrequency (%)
33 1
 
< 0.1%
11 1
 
< 0.1%
10 3
 
< 0.1%
9 6
 
< 0.1%
8 13
 
0.1%
7 38
 
0.2%
6 272
 
1.3%
5 1601
 
7.4%
4 6882
31.8%
3 9824
45.5%

bathrooms
Real number (ℝ)

High correlation 

Distinct30
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1147573
Minimum0
Maximum8
Zeros10
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:17.943856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11.75
median2.25
Q32.5
95-th percentile3.5
Maximum8
Range8
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.77016316
Coefficient of variation (CV)0.36418512
Kurtosis1.2799024
Mean2.1147573
Median Absolute Deviation (MAD)0.5
Skewness0.51110757
Sum45706.25
Variance0.59315129
MonotonicityNot monotonic
2025-05-31T21:02:18.462971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
2.5 5380
24.9%
1 3852
17.8%
1.75 3048
14.1%
2.25 2047
 
9.5%
2 1930
 
8.9%
1.5 1446
 
6.7%
2.75 1185
 
5.5%
3 753
 
3.5%
3.5 731
 
3.4%
3.25 589
 
2.7%
Other values (20) 652
 
3.0%
ValueCountFrequency (%)
0 10
 
< 0.1%
0.5 4
 
< 0.1%
0.75 72
 
0.3%
1 3852
17.8%
1.25 9
 
< 0.1%
1.5 1446
 
6.7%
1.75 3048
14.1%
2 1930
 
8.9%
2.25 2047
 
9.5%
2.5 5380
24.9%
ValueCountFrequency (%)
8 2
 
< 0.1%
7.75 1
 
< 0.1%
7.5 1
 
< 0.1%
6.75 2
 
< 0.1%
6.5 2
 
< 0.1%
6.25 2
 
< 0.1%
6 6
< 0.1%
5.75 4
 
< 0.1%
5.5 10
< 0.1%
5.25 13
0.1%

sqft_living
Real number (ℝ)

High correlation 

Distinct1038
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2079.8997
Minimum290
Maximum13540
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:19.276008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile940
Q11427
median1910
Q32550
95-th percentile3760
Maximum13540
Range13250
Interquartile range (IQR)1123

Descriptive statistics

Standard deviation918.4409
Coefficient of variation (CV)0.44157941
Kurtosis5.243093
Mean2079.8997
Median Absolute Deviation (MAD)540
Skewness1.4715554
Sum44952873
Variance843533.68
MonotonicityNot monotonic
2025-05-31T21:02:19.956471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1300 138
 
0.6%
1400 135
 
0.6%
1440 133
 
0.6%
1800 129
 
0.6%
1660 129
 
0.6%
1010 129
 
0.6%
1820 128
 
0.6%
1480 125
 
0.6%
1720 125
 
0.6%
1540 124
 
0.6%
Other values (1028) 20318
94.0%
ValueCountFrequency (%)
290 1
< 0.1%
370 1
< 0.1%
380 1
< 0.1%
384 1
< 0.1%
390 2
< 0.1%
410 1
< 0.1%
420 2
< 0.1%
430 1
< 0.1%
440 1
< 0.1%
460 1
< 0.1%
ValueCountFrequency (%)
13540 1
< 0.1%
12050 1
< 0.1%
10040 1
< 0.1%
9890 1
< 0.1%
9640 1
< 0.1%
9200 1
< 0.1%
8670 1
< 0.1%
8020 1
< 0.1%
8010 1
< 0.1%
8000 1
< 0.1%

sqft_lot
Real number (ℝ)

High correlation 

Distinct9782
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15106.968
Minimum520
Maximum1651359
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:20.807356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum520
5-th percentile1800
Q15040
median7618
Q310688
95-th percentile43339.2
Maximum1651359
Range1650839
Interquartile range (IQR)5648

Descriptive statistics

Standard deviation41420.512
Coefficient of variation (CV)2.7418151
Kurtosis285.07782
Mean15106.968
Median Absolute Deviation (MAD)2618
Skewness13.060019
Sum3.2650689 × 108
Variance1.7156588 × 109
MonotonicityNot monotonic
2025-05-31T21:02:21.605644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000 358
 
1.7%
6000 290
 
1.3%
4000 251
 
1.2%
7200 220
 
1.0%
4800 120
 
0.6%
7500 119
 
0.6%
4500 114
 
0.5%
8400 111
 
0.5%
9600 109
 
0.5%
3600 103
 
0.5%
Other values (9772) 19818
91.7%
ValueCountFrequency (%)
520 1
< 0.1%
572 1
< 0.1%
600 1
< 0.1%
609 1
< 0.1%
635 1
< 0.1%
638 1
< 0.1%
649 2
< 0.1%
651 1
< 0.1%
675 1
< 0.1%
676 1
< 0.1%
ValueCountFrequency (%)
1651359 1
< 0.1%
1164794 1
< 0.1%
1074218 1
< 0.1%
1024068 1
< 0.1%
982998 1
< 0.1%
982278 1
< 0.1%
920423 1
< 0.1%
881654 1
< 0.1%
871200 2
< 0.1%
843309 1
< 0.1%

floors
Real number (ℝ)

High correlation 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.494309
Minimum1
Maximum3.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:22.251360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5399889
Coefficient of variation (CV)0.36136361
Kurtosis-0.48472294
Mean1.494309
Median Absolute Deviation (MAD)0.5
Skewness0.61617672
Sum32296.5
Variance0.29158801
MonotonicityNot monotonic
2025-05-31T21:02:22.850908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 10680
49.4%
2 8241
38.1%
1.5 1910
 
8.8%
3 613
 
2.8%
2.5 161
 
0.7%
3.5 8
 
< 0.1%
ValueCountFrequency (%)
1 10680
49.4%
1.5 1910
 
8.8%
2 8241
38.1%
2.5 161
 
0.7%
3 613
 
2.8%
3.5 8
 
< 0.1%
ValueCountFrequency (%)
3.5 8
 
< 0.1%
3 613
 
2.8%
2.5 161
 
0.7%
2 8241
38.1%
1.5 1910
 
8.8%
1 10680
49.4%

waterfront
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.2 MiB
0
21450 
1
 
163

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters21613
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

Length

2025-05-31T21:02:23.444833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-05-31T21:02:24.055321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

Most occurring characters

ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 21450
99.2%
1 163
 
0.8%

view
Categorical

High correlation  Imbalance 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.2 MiB
0
19489 
2
 
963
3
 
510
1
 
332
4
 
319

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters21613
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

Length

2025-05-31T21:02:24.664319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-05-31T21:02:25.323112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

Most occurring characters

ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 19489
90.2%
2 963
 
4.5%
3 510
 
2.4%
1 332
 
1.5%
4 319
 
1.5%

condition
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.2 MiB
3
14031 
4
5679 
5
1701 
2
 
172
1
 
30

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters21613
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row5
5th row3

Common Values

ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

Length

2025-05-31T21:02:25.998087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-05-31T21:02:26.605199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

Most occurring characters

ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3 14031
64.9%
4 5679
26.3%
5 1701
 
7.9%
2 172
 
0.8%
1 30
 
0.1%

grade
Real number (ℝ)

High correlation 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.6568732
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:27.206928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q17
median7
Q38
95-th percentile10
Maximum13
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.1754588
Coefficient of variation (CV)0.15351681
Kurtosis1.1909321
Mean7.6568732
Median Absolute Deviation (MAD)1
Skewness0.7711032
Sum165488
Variance1.3817033
MonotonicityNot monotonic
2025-05-31T21:02:27.814883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
7 8981
41.6%
8 6068
28.1%
9 2615
 
12.1%
6 2038
 
9.4%
10 1134
 
5.2%
11 399
 
1.8%
5 242
 
1.1%
12 90
 
0.4%
4 29
 
0.1%
13 13
 
0.1%
Other values (2) 4
 
< 0.1%
ValueCountFrequency (%)
1 1
 
< 0.1%
3 3
 
< 0.1%
4 29
 
0.1%
5 242
 
1.1%
6 2038
 
9.4%
7 8981
41.6%
8 6068
28.1%
9 2615
 
12.1%
10 1134
 
5.2%
11 399
 
1.8%
ValueCountFrequency (%)
13 13
 
0.1%
12 90
 
0.4%
11 399
 
1.8%
10 1134
 
5.2%
9 2615
 
12.1%
8 6068
28.1%
7 8981
41.6%
6 2038
 
9.4%
5 242
 
1.1%
4 29
 
0.1%

sqft_above
Real number (ℝ)

High correlation 

Distinct946
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1788.3907
Minimum290
Maximum9410
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:28.518048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile850
Q11190
median1560
Q32210
95-th percentile3400
Maximum9410
Range9120
Interquartile range (IQR)1020

Descriptive statistics

Standard deviation828.09098
Coefficient of variation (CV)0.46303695
Kurtosis3.4023036
Mean1788.3907
Median Absolute Deviation (MAD)450
Skewness1.4466645
Sum38652488
Variance685734.67
MonotonicityNot monotonic
2025-05-31T21:02:29.323210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1300 212
 
1.0%
1010 210
 
1.0%
1200 206
 
1.0%
1220 192
 
0.9%
1140 184
 
0.9%
1400 180
 
0.8%
1060 178
 
0.8%
1180 177
 
0.8%
1340 176
 
0.8%
1250 174
 
0.8%
Other values (936) 19724
91.3%
ValueCountFrequency (%)
290 1
< 0.1%
370 1
< 0.1%
380 1
< 0.1%
384 1
< 0.1%
390 2
< 0.1%
410 1
< 0.1%
420 2
< 0.1%
430 1
< 0.1%
440 1
< 0.1%
460 1
< 0.1%
ValueCountFrequency (%)
9410 1
< 0.1%
8860 1
< 0.1%
8570 1
< 0.1%
8020 1
< 0.1%
7880 1
< 0.1%
7850 1
< 0.1%
7680 1
< 0.1%
7420 1
< 0.1%
7320 1
< 0.1%
6720 1
< 0.1%

sqft_basement
Real number (ℝ)

Zeros 

Distinct306
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean291.50905
Minimum0
Maximum4820
Zeros13126
Zeros (%)60.7%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:30.036297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3560
95-th percentile1190
Maximum4820
Range4820
Interquartile range (IQR)560

Descriptive statistics

Standard deviation442.57504
Coefficient of variation (CV)1.5182206
Kurtosis2.7155742
Mean291.50905
Median Absolute Deviation (MAD)0
Skewness1.5779651
Sum6300385
Variance195872.67
MonotonicityNot monotonic
2025-05-31T21:02:30.789179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 13126
60.7%
600 221
 
1.0%
700 218
 
1.0%
500 214
 
1.0%
800 206
 
1.0%
400 184
 
0.9%
1000 149
 
0.7%
900 144
 
0.7%
300 142
 
0.7%
200 108
 
0.5%
Other values (296) 6901
31.9%
ValueCountFrequency (%)
0 13126
60.7%
10 2
 
< 0.1%
20 1
 
< 0.1%
40 4
 
< 0.1%
50 11
 
0.1%
60 10
 
< 0.1%
65 1
 
< 0.1%
70 7
 
< 0.1%
80 20
 
0.1%
90 21
 
0.1%
ValueCountFrequency (%)
4820 1
< 0.1%
4130 1
< 0.1%
3500 1
< 0.1%
3480 1
< 0.1%
3260 1
< 0.1%
3000 1
< 0.1%
2850 1
< 0.1%
2810 1
< 0.1%
2730 1
< 0.1%
2720 1
< 0.1%

yr_built
Real number (ℝ)

High correlation 

Distinct116
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1971.0051
Minimum1900
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:31.522702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1915
Q11951
median1975
Q31997
95-th percentile2011
Maximum2015
Range115
Interquartile range (IQR)46

Descriptive statistics

Standard deviation29.373411
Coefficient of variation (CV)0.014902757
Kurtosis-0.6574075
Mean1971.0051
Median Absolute Deviation (MAD)23
Skewness-0.4698054
Sum42599334
Variance862.79726
MonotonicityNot monotonic
2025-05-31T21:02:32.374074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014 559
 
2.6%
2006 454
 
2.1%
2005 450
 
2.1%
2004 433
 
2.0%
2003 422
 
2.0%
2007 417
 
1.9%
1977 417
 
1.9%
1978 387
 
1.8%
1968 381
 
1.8%
2008 367
 
1.7%
Other values (106) 17326
80.2%
ValueCountFrequency (%)
1900 87
0.4%
1901 29
 
0.1%
1902 27
 
0.1%
1903 46
0.2%
1904 45
0.2%
1905 74
0.3%
1906 92
0.4%
1907 65
0.3%
1908 86
0.4%
1909 94
0.4%
ValueCountFrequency (%)
2015 38
 
0.2%
2014 559
2.6%
2013 201
 
0.9%
2012 170
 
0.8%
2011 130
 
0.6%
2010 143
 
0.7%
2009 230
1.1%
2008 367
1.7%
2007 417
1.9%
2006 454
2.1%

yr_renovated
Real number (ℝ)

Zeros 

Distinct70
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.402258
Minimum0
Maximum2015
Zeros20699
Zeros (%)95.8%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:33.178903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range (IQR)0

Descriptive statistics

Standard deviation401.67924
Coefficient of variation (CV)4.7591054
Kurtosis18.701152
Mean84.402258
Median Absolute Deviation (MAD)0
Skewness4.5494934
Sum1824186
Variance161346.21
MonotonicityNot monotonic
2025-05-31T21:02:33.991959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 20699
95.8%
2014 91
 
0.4%
2013 37
 
0.2%
2003 36
 
0.2%
2005 35
 
0.2%
2007 35
 
0.2%
2000 35
 
0.2%
2004 26
 
0.1%
1990 25
 
0.1%
2006 24
 
0.1%
Other values (60) 570
 
2.6%
ValueCountFrequency (%)
0 20699
95.8%
1934 1
 
< 0.1%
1940 2
 
< 0.1%
1944 1
 
< 0.1%
1945 3
 
< 0.1%
1946 2
 
< 0.1%
1948 1
 
< 0.1%
1950 2
 
< 0.1%
1951 1
 
< 0.1%
1953 3
 
< 0.1%
ValueCountFrequency (%)
2015 16
 
0.1%
2014 91
0.4%
2013 37
0.2%
2012 11
 
0.1%
2011 13
 
0.1%
2010 18
 
0.1%
2009 22
 
0.1%
2008 18
 
0.1%
2007 35
 
0.2%
2006 24
 
0.1%

zipcode
Real number (ℝ)

High correlation 

Distinct70
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98077.94
Minimum98001
Maximum98199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:34.822610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
median98065
Q398118
95-th percentile98177
Maximum98199
Range198
Interquartile range (IQR)85

Descriptive statistics

Standard deviation53.505026
Coefficient of variation (CV)0.00054553579
Kurtosis-0.85347887
Mean98077.94
Median Absolute Deviation (MAD)42
Skewness0.40566121
Sum2.1197585 × 109
Variance2862.7878
MonotonicityNot monotonic
2025-05-31T21:02:35.650933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
98103 602
 
2.8%
98038 590
 
2.7%
98115 583
 
2.7%
98052 574
 
2.7%
98117 553
 
2.6%
98042 548
 
2.5%
98034 545
 
2.5%
98118 508
 
2.4%
98023 499
 
2.3%
98006 498
 
2.3%
Other values (60) 16113
74.6%
ValueCountFrequency (%)
98001 362
1.7%
98002 199
 
0.9%
98003 280
1.3%
98004 317
1.5%
98005 168
 
0.8%
98006 498
2.3%
98007 141
 
0.7%
98008 283
1.3%
98010 100
 
0.5%
98011 195
 
0.9%
ValueCountFrequency (%)
98199 317
1.5%
98198 280
1.3%
98188 136
 
0.6%
98178 262
1.2%
98177 255
1.2%
98168 269
1.2%
98166 254
1.2%
98155 446
2.1%
98148 57
 
0.3%
98146 288
1.3%

lat
Real number (ℝ)

Distinct5034
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.560053
Minimum47.1559
Maximum47.7776
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:36.499865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum47.1559
5-th percentile47.3103
Q147.471
median47.5718
Q347.678
95-th percentile47.74964
Maximum47.7776
Range0.6217
Interquartile range (IQR)0.207

Descriptive statistics

Standard deviation0.13856371
Coefficient of variation (CV)0.0029134474
Kurtosis-0.676313
Mean47.560053
Median Absolute Deviation (MAD)0.1049
Skewness-0.48527048
Sum1027915.4
Variance0.019199902
MonotonicityNot monotonic
2025-05-31T21:02:37.316337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
47.6624 17
 
0.1%
47.5322 17
 
0.1%
47.6846 17
 
0.1%
47.5491 17
 
0.1%
47.6955 16
 
0.1%
47.6886 16
 
0.1%
47.6711 16
 
0.1%
47.5402 15
 
0.1%
47.6842 15
 
0.1%
47.6904 15
 
0.1%
Other values (5024) 21452
99.3%
ValueCountFrequency (%)
47.1559 1
< 0.1%
47.1593 1
< 0.1%
47.1622 1
< 0.1%
47.1647 1
< 0.1%
47.1764 1
< 0.1%
47.1775 1
< 0.1%
47.1776 2
< 0.1%
47.1795 1
< 0.1%
47.1803 1
< 0.1%
47.1808 1
< 0.1%
ValueCountFrequency (%)
47.7776 3
< 0.1%
47.7775 3
< 0.1%
47.7774 1
 
< 0.1%
47.7772 3
< 0.1%
47.7771 2
 
< 0.1%
47.777 2
 
< 0.1%
47.7769 3
< 0.1%
47.7768 2
 
< 0.1%
47.7767 6
< 0.1%
47.7766 4
< 0.1%

long
Real number (ℝ)

High correlation 

Distinct752
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-122.2139
Minimum-122.519
Maximum-121.315
Zeros0
Zeros (%)0.0%
Negative21613
Negative (%)100.0%
Memory size169.0 KiB
2025-05-31T21:02:38.107104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-122.519
5-th percentile-122.387
Q1-122.328
median-122.23
Q3-122.125
95-th percentile-121.979
Maximum-121.315
Range1.204
Interquartile range (IQR)0.203

Descriptive statistics

Standard deviation0.14082834
Coefficient of variation (CV)-0.0011523104
Kurtosis1.0495009
Mean-122.2139
Median Absolute Deviation (MAD)0.101
Skewness0.88505298
Sum-2641408.9
Variance0.019832622
MonotonicityNot monotonic
2025-05-31T21:02:38.899206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-122.29 116
 
0.5%
-122.3 111
 
0.5%
-122.362 104
 
0.5%
-122.291 100
 
0.5%
-122.363 99
 
0.5%
-122.372 99
 
0.5%
-122.288 98
 
0.5%
-122.357 96
 
0.4%
-122.284 95
 
0.4%
-122.365 94
 
0.4%
Other values (742) 20601
95.3%
ValueCountFrequency (%)
-122.519 1
 
< 0.1%
-122.515 1
 
< 0.1%
-122.514 1
 
< 0.1%
-122.512 1
 
< 0.1%
-122.511 2
< 0.1%
-122.509 2
< 0.1%
-122.507 1
 
< 0.1%
-122.506 1
 
< 0.1%
-122.505 3
< 0.1%
-122.504 2
< 0.1%
ValueCountFrequency (%)
-121.315 2
< 0.1%
-121.316 1
< 0.1%
-121.319 1
< 0.1%
-121.321 1
< 0.1%
-121.325 1
< 0.1%
-121.352 2
< 0.1%
-121.359 1
< 0.1%
-121.364 2
< 0.1%
-121.402 1
< 0.1%
-121.403 1
< 0.1%

sqft_living15
Real number (ℝ)

High correlation 

Distinct777
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1986.5525
Minimum399
Maximum6210
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:39.613369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum399
5-th percentile1140
Q11490
median1840
Q32360
95-th percentile3300
Maximum6210
Range5811
Interquartile range (IQR)870

Descriptive statistics

Standard deviation685.3913
Coefficient of variation (CV)0.34501545
Kurtosis1.5970958
Mean1986.5525
Median Absolute Deviation (MAD)410
Skewness1.1081813
Sum42935359
Variance469761.24
MonotonicityNot monotonic
2025-05-31T21:02:40.304189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1540 197
 
0.9%
1440 195
 
0.9%
1560 192
 
0.9%
1500 181
 
0.8%
1460 169
 
0.8%
1580 167
 
0.8%
1610 166
 
0.8%
1720 166
 
0.8%
1800 166
 
0.8%
1620 165
 
0.8%
Other values (767) 19849
91.8%
ValueCountFrequency (%)
399 1
 
< 0.1%
460 2
 
< 0.1%
620 2
 
< 0.1%
670 1
 
< 0.1%
690 2
 
< 0.1%
700 2
 
< 0.1%
710 2
 
< 0.1%
720 2
 
< 0.1%
740 8
< 0.1%
750 3
 
< 0.1%
ValueCountFrequency (%)
6210 1
 
< 0.1%
6110 1
 
< 0.1%
5790 6
< 0.1%
5610 1
 
< 0.1%
5600 1
 
< 0.1%
5500 1
 
< 0.1%
5380 1
 
< 0.1%
5340 1
 
< 0.1%
5330 1
 
< 0.1%
5220 1
 
< 0.1%

sqft_lot15
Real number (ℝ)

High correlation 

Distinct8689
Distinct (%)40.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12768.456
Minimum651
Maximum871200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size169.0 KiB
2025-05-31T21:02:41.092509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum651
5-th percentile1999.2
Q15100
median7620
Q310083
95-th percentile37062.8
Maximum871200
Range870549
Interquartile range (IQR)4983

Descriptive statistics

Standard deviation27304.18
Coefficient of variation (CV)2.1384089
Kurtosis150.76311
Mean12768.456
Median Absolute Deviation (MAD)2505
Skewness9.5067432
Sum2.7596463 × 108
Variance7.4551823 × 108
MonotonicityNot monotonic
2025-05-31T21:02:41.857700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000 427
 
2.0%
4000 357
 
1.7%
6000 289
 
1.3%
7200 211
 
1.0%
4800 145
 
0.7%
7500 142
 
0.7%
8400 116
 
0.5%
3600 111
 
0.5%
4500 111
 
0.5%
5100 109
 
0.5%
Other values (8679) 19595
90.7%
ValueCountFrequency (%)
651 1
 
< 0.1%
659 1
 
< 0.1%
660 1
 
< 0.1%
748 2
< 0.1%
750 4
< 0.1%
755 1
 
< 0.1%
757 1
 
< 0.1%
758 1
 
< 0.1%
788 1
 
< 0.1%
794 1
 
< 0.1%
ValueCountFrequency (%)
871200 1
< 0.1%
858132 1
< 0.1%
560617 1
< 0.1%
438213 1
< 0.1%
434728 1
< 0.1%
425581 1
< 0.1%
422967 1
< 0.1%
411962 1
< 0.1%
392040 2
< 0.1%
386812 1
< 0.1%

price_gt_1M
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.2 MiB
0
20121 
1
 
1492

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters21613
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Length

2025-05-31T21:02:42.448645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-05-31T21:02:43.006185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Most occurring characters

ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21613
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 20121
93.1%
1 1492
 
6.9%

Interactions

2025-05-31T21:02:05.703116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:10.558155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:18.973276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:27.681348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:36.636575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:44.992761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:53.610489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:01.823883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:09.147543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:16.717783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:24.885323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:33.719366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:42.148689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:50.052576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:57.803058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:06.224796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:11.112273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:19.513726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:28.210376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:37.173834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:45.535056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:54.133410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:02.311768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:09.669239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:17.267141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:25.409177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:34.292519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:42.657331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:50.515194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:58.340645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:06.746138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:11.684878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:20.085878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:28.821462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:37.746157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:46.097231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:54.672085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:02.846430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:10.211147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:17.798857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:25.990039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:34.884817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:43.183197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:51.100931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:58.910076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:08.205930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:12.240831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:20.656877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:29.393515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:38.317569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:46.650611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:55.195214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:03.391913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:10.763912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:18.348379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:26.568773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:35.463167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:43.712209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:51.684443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:59.465836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:08.570267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:12.811329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:21.211391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:30.618439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:38.825739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:47.182345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:55.733987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:03.907542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:11.228946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:18.892869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:27.117496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:36.047086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:44.240889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:52.220587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:59.988065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:09.077346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:13.370242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:21.765728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:31.160119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:39.343269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:47.694898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:56.031134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:04.390333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:11.749011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:19.477189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:27.627806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:36.575831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:44.762700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:52.726239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:00.510009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:09.567352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:13.939879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:22.323402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:31.700095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:39.873240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:48.183811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:57.190962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:04.823493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:12.064476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:20.024442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:28.837985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:37.119194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:45.276059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:53.232473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:01.015760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:10.029109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:14.481744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:22.830571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:32.207236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:40.366847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:48.716798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:57.666089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:05.219961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:12.534133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:20.536457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:29.305961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:37.651221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:45.766628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:53.722494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:01.412441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:10.541494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:15.053970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:23.738417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:32.746681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:40.890516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:49.258999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:58.172790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:05.719878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:13.038641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:21.078968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:29.818188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:38.195844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:46.306680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:54.244630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:01.886973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:11.067910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:15.627174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:24.326710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:33.285874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:41.450880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:49.851028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:58.711699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:06.220893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:13.469695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:21.537226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:30.370817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:38.789581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:46.845942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:54.671095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:02.435560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:11.626333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:16.246249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:24.914065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:33.873302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:42.034196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:50.476581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:59.250829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:06.815060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:14.042933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:22.154297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:30.941883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:39.407794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:47.413911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:55.118087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:03.012442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:12.197630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:16.849703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:25.530179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:34.460407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:42.636901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:51.117249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:59.837101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:07.209135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:14.624396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:22.739257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:31.560217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:39.967812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:48.004667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:55.699683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:03.598483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:12.755731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:17.421869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:26.084209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:35.031189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:43.239888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:51.755939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:00.352491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:07.722035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:15.166064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:23.289874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:32.114260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:40.532222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:48.515308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:56.268754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:04.168275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:13.266505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:17.959955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:26.622188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:35.586757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:43.831440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:52.381795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:00.839625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:08.228261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:15.708965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:23.853961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:32.676674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:41.103443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:49.048505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:56.774562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:04.689930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:13.744102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:18.466871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:27.159458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:36.125971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:44.429500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:00:53.134676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:01.357579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:08.679945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:16.217151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:24.398700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:33.227561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:41.630275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:49.547419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:01:57.312667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2025-05-31T21:02:05.195935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2025-05-31T21:02:43.501732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
bathroomsbedroomsconditionfloorsgradelatlongprice_gt_1Msqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
bathrooms1.0000.5210.1300.5470.6580.0080.2620.4470.6910.1920.7460.5700.0690.0630.1140.1020.5670.043-0.205
bedrooms0.5211.0000.0240.2280.381-0.0210.1910.1980.5400.2300.6470.4440.2170.2020.0380.0000.1800.017-0.167
condition0.1300.0241.0000.1790.1540.0580.0810.0520.1070.0940.0600.0620.0390.0130.0250.0170.2480.0670.074
floors0.5470.2280.1791.0000.5020.0250.1490.1860.599-0.2720.4010.305-0.234-0.2310.0240.0220.5520.013-0.061
grade0.6580.3810.1540.5021.0000.1040.2230.5870.7120.0930.7160.6630.1520.1560.1430.1180.5010.016-0.182
lat0.008-0.0210.0580.0250.1041.000-0.1430.268-0.0260.1160.0310.028-0.122-0.1170.0680.034-0.1260.0250.250
long0.2620.1910.0810.1490.223-0.1431.0000.1180.385-0.2000.2850.3800.3710.3730.0850.0960.413-0.075-0.577
price_gt_1M0.4470.1980.0520.1860.5870.2680.1181.0000.4650.2860.5490.4710.0300.0100.3590.1970.0850.1070.131
sqft_above0.6910.5400.1070.5990.712-0.0260.3850.4651.000-0.1660.8440.6970.2720.2540.0890.0830.4720.031-0.279
sqft_basement0.1920.2300.094-0.2720.0930.116-0.2000.286-0.1661.0000.3280.1300.0370.0310.1590.134-0.1780.0630.115
sqft_living0.7460.6470.0600.4010.7160.0310.2850.5490.8440.3281.0000.7470.3040.2840.1490.1400.3520.053-0.207
sqft_living150.5700.4440.0620.3050.6630.0280.3800.4710.6970.1300.7471.0000.3600.3660.1470.0890.336-0.006-0.287
sqft_lot0.0690.2170.039-0.2340.152-0.1220.3710.0300.2720.0370.3040.3601.0000.9220.0400.014-0.0380.009-0.319
sqft_lot150.0630.2020.013-0.2310.156-0.1170.3730.0100.2540.0310.2840.3660.9221.0000.0350.000-0.0160.009-0.326
view0.1140.0380.0250.0240.1430.0680.0850.3590.0890.1590.1490.1470.0400.0351.0000.5920.0410.1090.074
waterfront0.1020.0000.0170.0220.1180.0340.0960.1970.0830.1340.1400.0890.0140.0000.5921.0000.0320.0920.079
yr_built0.5670.1800.2480.5520.501-0.1260.4130.0850.472-0.1780.3520.336-0.038-0.0160.0410.0321.000-0.215-0.317
yr_renovated0.0430.0170.0670.0130.0160.025-0.0750.1070.0310.0630.053-0.0060.0090.0090.1090.092-0.2151.0000.062
zipcode-0.205-0.1670.074-0.061-0.1820.250-0.5770.131-0.2790.115-0.207-0.287-0.319-0.3260.0740.079-0.3170.0621.000

Missing values

2025-05-31T21:02:14.495549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2025-05-31T21:02:15.986548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

bedroomsbathroomssqft_livingsqft_lotfloorswaterfrontviewconditiongradesqft_abovesqft_basementyr_builtyr_renovatedzipcodelatlongsqft_living15sqft_lot15price_gt_1M
031.00118056501.0003711800195509817847.5112-122.257134056500
132.25257072422.000372170400195119919812547.7210-122.319169076390
221.00770100001.000367700193309802847.7379-122.233272080620
343.00196050001.000571050910196509813647.5208-122.393136050000
432.00168080801.0003816800198709807447.6168-122.045180075030
544.5054201019301.00031138901530200109805347.6561-122.00547601019301
632.25171568192.0003717150199509800347.3097-122.327223868190
731.50106097111.0003710600196309819847.4095-122.315165097110
831.00178074701.000371050730196009814647.5123-122.337178081130
932.50189065602.0003718900200309803847.3684-122.031239075700
bedroomsbathroomssqft_livingsqft_lotfloorswaterfrontviewconditiongradesqft_abovesqft_basementyr_builtyr_renovatedzipcodelatlongsqft_living15sqft_lot15price_gt_1M
2160332.50227055362.0003822700200309806547.5389-121.881227057310
2160432.00149011263.0003814900201409814447.5699-122.288140012300
2160542.50252060232.0003925200201409805647.5137-122.167252060230
2160643.50351072002.000392600910200909813647.5537-122.398205062001
2160732.50131012942.000381180130200809811647.5773-122.409133012650
2160832.50153011313.0003815300200909810347.6993-122.346153015090
2160942.50231058132.0003823100201409814647.5107-122.362183072000
2161020.75102013502.0003710200200909814447.5944-122.299102020070
2161132.50160023882.0003816000200409802747.5345-122.069141012870
2161220.75102010762.0003710200200809814447.5941-122.299102013570

Duplicate rows

Most frequently occurring

bedroomsbathroomssqft_livingsqft_lotfloorswaterfrontviewconditiongradesqft_abovesqft_basementyr_builtyr_renovatedzipcodelatlongsqft_living15sqft_lot15price_gt_1M# duplicates
4431.00108062501.0002510800195009816847.5045-122.3301070625003
010.7584072031.500368400194909816847.4756-122.3011560860302
111.0062082611.000356200193909810647.5138-122.3641180824402
211.0085080501.000268500190609811847.5427-122.2881590518002
311.0090063801.000369000194709812547.7019-122.3111830638002
412.00115098121.0004711500196209800147.2951-122.2841200981202
521.0058075001.000355800194309817847.4852-122.25117001125002
621.0070048001.000377000192209812247.6147-122.3001440480002
721.0079071531.000467900194409816847.4869-122.324810712802
821.00790112341.000467900194209816647.4413-122.34919301187102